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ABSTRACT 

The study of stellar populations in galaxies is entering a new era with the availability 
of large and high quality databases of both observed galactic spectra and state-of-the- 
art evolutionary synthesis models. In this paper we investigate the power of spectral 
OO . synthesis as a mean to estimate physical properties of galaxies. Spectral synthesis is 

\l ' nothing more than the decomposition of an observed spectrum in terms of a superposi- 

tion of a base of simple stellar populations of various ages and metallicities, producing 
as output the star-formation and chemical histories of a galaxy, its extinction and 
velocity dispersion. Our implementation of this method uses the Bruzual & Chariot 
(2003) models and observed spectra in the 3650-8000 A range. The reliability of this 
■ approach is studied by three different means: (1) simulations, (2) comparison with 

previous work based on a different technique, and (3) analysis of the consistency of 
Q . results obtained for a sample of galaxies from the Sloan Digital Sky Survey (SDSS). 

• We find that spectral synthesis provides reliable physical parameters as long as 

\ one does not attempt a very detailed description of the star-formation and chemical 

d . histories. Robust and physically interesting parameters are obtained by combining the 

(individually uncertain) strengths of each simple stellar population in the base. In 
particular, we show that besides providing excellent fits to observed galaxy spectra, 
>^ ' this method is able to recover useful information on the distributions of stellar ages 

^ ' and, more importantly, stellar metallicities. Stellar masses, velocity dispersion and 

extinction are also found to be accurately retrieved for realistic signal-to-noise ratios. 

We apply this synthesis method to a volume limited sample of 50362 galaxies 
from the SDSS Data Release 2, producing a catalog of stellar population properties. 
Emission lines are also studied, their measurement being performed after subtracting 
the computed starlight spectrum from the observed one. A comparison with recent 
estimates of both observed and physical properties of these galaxies obtained by other 
groups shows good qualitative and quantitative agreement, despite substantial differ- 
ences in the method of analysis. The confidence in the method is further strengthened 
by several empirical and astrophysically reasonable correlations between synthesis re- 
sults and independent quantities. For instance, we report the existence of strong corre- 
lations between stellar and nebular mctallicites, stellar and nebular extinctions, mean 
stellar age and equivalent width of Ha and 4000 A break, and between stellar mass 
and velocity dispersion. 

Key vifords: galaxies: stellar content - galaxies: statistics - galaxies: fundamental 
parameters - galaxies: evolution 
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1 INTRODUCTION 

Galaxy spectra encode information on the age and metal- 
licity distributions of the constituent stars, which in turn 
reflect the star-formation and chemical histories of the galax- 
ies. Retrieving this information from observational data in a 
reliable way is crucial for a deeper understanding of galaxy 
formation and evolution. 

The mapping of observed onto physical properties of 
galaxies has been a major topic of research for over a gener- 
ation of astronomers since the pioneering works of Morgan 
(1956), Wood (1966) and Faber (1972) on the one hand, and 
Tinsley (1968) and Spinrad & Taylor (1972) on the other. 
The first group of authors introduced the so-called empiri- 
cal population synthesis methods, which aim at reproducing 
a set of observations of a given galaxy by means of a lin- 
ear combination of simpler systems of known characteristics, 
like individual stars or chemically homogeneous and coeval 
groups of stars. Bica (1988), Pelat (1997), Cid Fernandes 
et al. (2001), Moultaka etal. (2004) are examples of modern 
studies following this approach. The second group pioneered 
the so-called evolutionary population synthesis methods, 
which compare galaxy data with models that follow the time 
evolution of an entire stellar system by combining libraries of 
evolutionary tracks and stellar spectra with prescriptions for 
the initial mass function (IMF), star formation and chem- 
ical histories. This approach has enjoyed more widespread 
use in recent years, e.g., Arimoto & Yoshii (1987), Guider- 
doni & Rocca-Volmerange (1987), Bressan et al. (1994), Fioc 
& Rocca-Volmerange (1997), Vazdekis (1999), Bruzual & 
Chariot (2003, hereafter BC03), Le Borgne etal. (2004), 
among many others (see Cardiel et al. 2003 for a large set 
of references). In short, empirical synthesis relies on nature 
for its basic ingredients, whereas evolutionary synthesis re- 
lies mostly on models. However, since models are made and 
calibrated to mimic nature, this difference is gradually van- 
ishing as models improve. 

Besides different methodologies, there are also differ- 
ences in what type of data is actually modeled. Colours (e.g.. 
Wood 1966), absorption line equivalent widths or spectral 
indices (e.g., Worthey 1994; Kauffmann etal. 2003, here- 
after K03) and emission features, both stellar (Leitherer, 
Robert & Heckman 1995; Schaerer & Vacca 1998) and neb- 
ular (Mas-Hesse & Kunth 1991; Kewley et al. 2001), have aU 
been used in stellar population synthesis. More recently, the 
full spectral information has been incorporated in the mod- 
eling process, both including (Chariot & Longhetti 2001), 
and excluding emission lines (Vazdekis & Arimoto 1999; 
Reichardt, Jimenez & Heavens 2001; Cid Fernandes etal. 
2004a, hereafter CF04). 

Recovering the stellar content of a galaxy from its ob- 
served integrated spectrum is not an easy task, as can be 
deduced from the amount of work devoted to this topic 
over the past ~ half century. The situation is however much 
more favorable nowadays. Huge observational and theoret- 
ical efforts in the past few years have produced large sets 
of high quality spectra of stars (e.g., Prugniel & Soubiran 
2001; Le Borgne etal. 2003; Bertone etal. 2004; Gonzalez 
Delgado etal. 2004). These libraries are being implemented 
in a new generation of evolutionary synthesis models, al- 
lowing the prediction of galaxy spectra with an unprece- 
dented level of detail (Vazdekis 1999; BC03, Le Borgne etal. 



2004). At the same time, galaxy spectra are now more abun- 
dant than ever (Loveday etal. 1996; York etal. 2000). The 
Sloan Digital Sky Survey (SDSS), in particular, is provid- 
ing a homogeneous data base of hundreds of thousands of 
galaxy spectra in the 3800-9200 A range, with a resolution 
of A/AA ~ 1800 (York et al. 2000; Stoughton et al. 2002; 
Abazajian et al. 2003, 2004). This enormous amount of high 
quality data will undoubtedly be at the heart of tremendous 
progress in our understanding of galaxy constitution, for- 
mation and evolution. Indeed, significant steps in this direc- 
tion have recently been made (K03; Brinchmann et al. 2004, 
Tremonti et al. 2004; Heavens et al. 2004; Panter, Heavens & 
Jimenez 2004). 

In order to take advantage of the recent progress in evo- 
lutionary synthesis to analyze data sets such as the SDSS, a 
methodology must be set up to go from the observed spectra 
to physical properties of galaxies. In this paper we discuss 
one possible method to achieve this goal. The method is 
based on fitting an observed spectrum with a linear combi- 
nation of simple theoretical stellar populations (coeval and 
chemically homogeneous) computed with evolutionary syn- 
thesis models at the same spectral resolution as that of the 
SDSS. 

Our goal here is to demonstrate that, besides provid- 
ing excellent starlight templates to aid emission line studies, 
spectral synthesis recovers reliable stellar population prop- 
erties out of galaxy spectra of realistic quality. We show 
that this simple method provides robust information on the 
stellar age (f*) and stellar metallicities (Z*) distributions, 
as well as on the extinction, velocity dispersion and stellar 
mass. The ability to recover information on Z^, is particu- 
larly welcome, given that stellar metallicities are notoriously 
more difficult to assess than other properties. In order to 
reach this goal we follow: (1) a priori arguments, based on 
simulations; (2) comparisons with independent work based 
on a different method, and (3) an a posteriori empirical anal- 
ysis of the consistency of results obtained for a large sample 
of SDSS galaxies. Other papers in this series on the Semi- 
Empirical Analysis of Galaxies (SEAGal) will explore vari- 
ous astrophysical implications of the results obtained with 
this method. 

This paper is organized as follows. Section|5|presents an 
overview of our synthesis method and simulations designed 
to test it and evaluate the uncertainties involved. The discus- 
sion is focused on how to use the synthesis to derive robust 
estimators of physically interesting stellar population prop- 
erties. Section |21 defines a volume limited sample of SDSS 
galaxies and presents the results of the synthesis of their 
spectra, along with measurements of emission lines. In Sec- 
tion|3]we compare our results to those recently published by 
Brinchmann etal. (2004). Stellar population and emission 
line properties are used in Section |K| to investigate whether 
the synthesis produces astrophysically plausible results. Fi- 
nally, Section |S| summarizes our main findings. 

2 SPECTRAL SYNTHESIS 
2.1 Method 

Our synthesis code, which we call STARLIGHT, was first 
discussed in CF04 (see also Cid Fernandes et al. 2004b and 
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Garcia- Rissman et al. 2005 for different applications of the 
same code). STARLIGHT mixes computational techniques 
originally developed for empirical population synthesis with 
ingredients from evolutionary synthesis models. Briefly, we 
fit an observed spectrum Ox with a combination of Af* Sim- 
ple Stellar Populations (SSP) from the evolutionary syn- 
thesis models of BC03. Extinction is modeled as due to 
foreground dust, and parametrized by the V-band extinc- 
tion Av- The Galactic extinction law of Cardelli, Clayton 
& Mathis (1989) with Rv ~ 3.1 is adopted. Line of sight 
stellar motions are modeled by a Gaussian distribution G 
centered at velocity and with dispersion a*. With these 
assumptions the model spectrum is given by 



Mx = Mxo 



E 



Xjbj,xrx 



(1) 



where bj^x is the spectrum of the j'^ SSP normalized at 
Ao, rx = 10"''"f^^"'*^o) is the reddening term, Mxo is the 
synthetic flux at the normalization wavelength, x is the pop- 
ulation vector and Cg) denotes the convolution operator. Each 
component Xj {j — 1 . . . N^) represents the fractional con- 
tribution of the SSP with age tj and metallicity^ Zj to the 
model flux at Ao. The base components can be equivalently 
expressed as a mass fractions vector /I. In this work we adopt 
a base with A* = 45 SSPs, encompassing 15 ages between 
10'' and 1.3 x 10^" yr and 3 metallicities: Z = 0.2, 1 and 2.5 
Zq. Their spectra, shown in Fig.0 were computed with the 
STELIB library (Le Borgne et al. 2003), Padova 1994 tracks, 
and Chabrier (2003) IMF (see BC03 for details). 

The fit is carried out with a simulated annealing plus 
Metropolis scheme which searches for the minimum = 
X^A [(^^^ ~ "^-^I'^i where w^^ is the error in Ox- Regions 
around emission lines, bad pixels or sky residuals are masked 
out by setting wx ~ 0. Pixels which deviate by more than 
3 times the rms between Ox and an initial estimate of Mx 
are also given zero weight. 

The minimization consists of a series of Nm ~ 6 
likelihood-guided Metropolis explorations of the parame- 
ter space. From each iteration to the next we increase the 
Wx weights geometrically (which corresponds to a decrease 
in the "temperature" in statistical mechanics terms; e.g., 
MacKay 2003). The step-size in each parameter is concomi- 
tantly decreased and the number of steps is increased. This 
scheme gradually focuses on the most likely region in pa- 
rameter space, avoiding (through the logic of the Metropo- 
lis algorithm) trapping onto local minima. After completion, 
the whole fit is fine tuned repeating the full loop excluding 
all Xj = components. An important difference (from the 
computational point of view) with respect to the code in 
CF04 is that we now perform a series expansion of the rx 
extinction factor, which allows a much faster computation 
of X ■ Naturally, there are a number of technical parameters 
in this complex fitting algorithm, which we have optimized 
by means of extensive simulations (see CF04). At any rate, 
results reported here are robust with respect to variations 
in these technical parameters. 



^ In this paper we follow the convention used in stellar evolution 
studies, which define stellar metallicities in terms of the fraction 
of mass in metals. In this system the Sun has = 0.02. 



Figs, m and |3 illustrate the spectral fits obtained for 
two galaxies drawn from the SDSS database. The top-left 
panel shows the observed spectrum (thin line) and the model 
(thick), as well as the error spectrum (dashed). The bottom- 
left panel shows the Ox — Mx residual spectrum, while the 
panels in the right summarize the derived star-formation 
history encoded in the age-binned population vector. These 
examples, along with those in K03 and CF04, demonstrate 
that this simple method is capable of reproducing real galaxy 
spectra to an exceUent degree of accuracy. 

An important application of the synthesis is to measure 
emission lines from the residual spectrum, as done by K03 
(see also Section [3.311 . Another, of course, is to infer stellar 
population properties from the fit parameters. This was not 
the approach followed by K03 and their subsequent papers, 
who prefer to derive stellar population properties mainly 
from the _D„(4000) versus H5a diagram. Our central goal 
here is to investigate whether spectral synthesis can also re- 
cover reliable stellar population properties. In the remainder 
of this section we address this issue by means of simulations. 

2.2 Robust description of the synthesis results 

The existence of multiple solutions is an old known prob- 
lem in stellar population synthesis. This multiplicity arises 
from a combination of three factors: (1) algebraic degeneracy 
(number of unknowns larger than number of observables) , 
(2) intrinsic degeneracies of stellar populations and (3) mea- 
surement uncertainties. Unlike in methods which synthesize 
only a handful of spectral indices, algebraic degeneracy is 
not a problem for the method outlined above, as the number 
of Ox points in any decently sampled spectrum far exceeds 
the number of parameters even for large bases. Similarly, by 
modeling the whole spectrum one should be able to allevi- 
ate degeneracies in spectral indices associated with different 
stellar populations (Jimenez etal. 2004). Measurement er- 
rors, however, are still a problem. A corollary of these com- 
plications is that even superb spectral fits as those shown in 
Figs. 121 and |3] do not guarantee that the resulting parameter 
estimates are trustworthy. 

This brief discussion illustrates the need to assess the 
degree to which one can trust the parameters involved in the 
fit before using them to infer stellar populations properties. 
As posed above, the spectral fits involve A* + 3 parameters: 
A* — 1 of the X components (one degree of freedom is re- 
moved by the normalization constraint), Mxg, Av and the 
two kinematical parameters, and a*. The reliability of pa- 
rameter estimation is best studied by means of simulations 
which feed the code with spectra generated with known pa- 
rameters, add noise, and then examine the correspondence 
between input and output values. 

CF04 performed this kind of simulation for a A* = 20 
base and spectra in the 3500-5200 A interval. Their analy- 
sis concentrated on how well the method recovers Xinput for 
varying degrees of noise. The main results of that study are: 
(1) In the absence of noise, the method recovers all compo- 
nents of a; to a high degree of accuracy. (2) In the presence 
of noise, however, the individual output Xj fractions may 
deviate drastically from the input values. Essentially, what 
happens is that noise washes away the differences between 
spectrally similar components, so it becomes impossible to 
distinguish them, and the code splits Xj among neighboring 
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Figure 1. Spectra of the 45 SSPs used in tlie spectral synthesis (from BC03). The base comprises 3 different metallicities, Z = 0.2, 1 
and 2.5 Zq, and 15 ages: From top to bottom, t = 0.001, 0.00316, 0.00501, 0.01, 0.02512, 0.04, 0.10152, 0.28612, 0.64054, 0.90479, 1.434, 
2.5, 5, 11 and 13 Gyr. All SSPs arc normalized to 1 M0 at t = 0. 



components in spectral space. This is a common problem 
in population synthesis (e.g., Panter etal. 2004; Tremonti 
2003; Cid Fernandes etal. 2001). 

Once we have identified the origin of the problem, the 
remedy to fix it is evident: bin-over spectrally similar com- 
ponents. In other words, instead of attempting a fine graded 
description of stellar populations mixtures in terms of many 
ages and metallicities, it is much better to "marginalize 
over the details" and work with a coarser but more robust 
description based on combined Xj fractions. As shown by 
CF04, condensed versions of the population vector which 



project its A'^* components onto just a few physically inter- 
esting axes, yield very reliable results. 

Our approach and objectives are conceptually similar 
to those of the MOPED code of Heavens, Jimenez & La- 
hav (2000; see also Reichardt etal. 2001, Panter etal. 2004). 
Operationally, however, the two codes differ. MOPED com- 
presses data on input by degrading the SDSS spectral reso- 
lution to 20 A and performing weighted linear combinations 
of Ox, reducing its dimension to one datum per output pa- 
rameter while at the same time minimizing the loss of infor- 
mation. STARLIGHT, on the other hand, works with the 
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Figure 2. Spectral synthesis of an early type SDSS galaxy. Top-left: Observed (thin, green line), model (thick, black ) and error spectra 
(dashed, blue). Points in the bottom indicate bad pixels (as given by the SDSS flag) or emission lines windows, both of which were 
masked out in the fits. Bottom-left: Residual spectrum. Masked regions are plotted with a thin, green line. Right: Flux (top) and mass 
(bottom) fractions as a function of age. Some of the derived properties are listed in the top-right. Xx the reduced x'^i and Ax is the 
mean relative difference between model and observed spectra; S/N refers to the region around Aq = 4020 A. 



spectrum at its full resolution and an overdimensioned set 
of parameters (i.e., large A''*), which we compress on the de- 
scription of the output. The disadvantage of our approach 
is computational. Indeed, it is described as a "slow" and 
"brute force" method by Panter etal. (2004). This, how- 
ever, is not a severe problem given the abundance of CPUs 
nowadays (STARLIGHT takes about 4 minutes per galaxy 
on a 2 GHz Linux- workstation) . Furthermore, there is plenty 
of room for improvement in the efficiency of the algorithm. 



for instance, by implementing a smarter exploration of the 
parameter space (Slosar & Hobson 2003), to the point that 
computational constraints could soon become a minor con- 
cern. An advantage of STARLIGHT is that it also measures 
kinematical parameters (mainly o"*) and provides a high res- 
olution template spectrum. 
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Figure 3. As Fig.^lbut for a late-type galaxy. 



2.2.1 Simulations 

We have carried out new simulations designed to test the 
method and investigate which combinations of the param- 
eters provide robust results. These simulations differ from 
those in CF04 in three main aspects: (1) the spectral range 
is now 3650-8000 A; (2) the new base is larger; (3) a more 
realistic error spectrum was used. A further difference is that 
we use a larger number of iterations, partly because we have 
more base elements than CF04 and partly because the code 
is now over 200 times faster. At each step in the annealing 
scheme, the number of steps per parameter is set to 2L/e, 
where L is the length of the allowed range for the parameter 



(e.g., 1/ = 1 for Xj fractions, which range from to 1), and 
e is the step-size (see CF04 for a more detailed description 
of these technical aspects). This insures that each param- 
eter can in principle random-walk twice its whole allowed 
range, which is an adequate criterion for Metropolis sam- 
pling (e.g., MacKay 2003). Furthermore, there are 12 stages 
in the annealing schedule. Overall, over 10^ combinations of 
parameters are sampled for each galaxy. 

Several sets of simulations were performed. Given our 
interest in modeling SDSS galaxies, here we focus on sim- 
ulations tailored to match the characteristics of this data 
set. Test galaxies were built from the average x, Ay and a* 
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within 65 boxes in the mean stellar age versus mean stel- 
lar metallicity plane obtained for the sample described in 
Sectional Each spectrum covers the 3650-8000 A range in 
steps of 1 A, the same sampling of the BC03 models. We 
generate 20 perturbed versions of each synthetic spectrum 
for each of 5 levels of noise: S/N = 5, 10, 15, 20 and 30, 
where S/N is the signal-to-noise ratio per A in the region 
around Ao = 4020 A. Unlike in CF04, who used a flat er- 
ror spectrum, the error at each A was assumed to follow a 
Gaussian distribution with amplitude obtained by scaling a 
normalized mean SDSS error spectrum to yield the desired 
S/N at Ao- This error spectrum decreases by a factor of ~ 3 
from 3650 to 6200 A and then increases again by ~ 1.5 to- 
wards 8000 A. Finally, as for the actual data fits, we mask 
points around [O ii]AA3726,3729, [Ne ni]A3869, He, R5, H7, 
up, [O ni]AA4959,5007, [He i]A5876, NaDA5890, [O i]A6300, 
[N ii]AA6548,6583, Ha, [S ii]AA6717,6731. 

These new simulations confirm that the individual com- 
ponents of X are very uncertain, so we skip a detailed com- 
parison between 5input and ^output and jump straight to 
results based on more robust descriptions of the synthesis 
output. 
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2.2.2 Condensed population vector 

A coarse but robust description of the star-formation history 
of a galaxy may be obtained by binning x onto "Young" 
{tj < 10** yr), "Intermediate-age" (10* < tj < 10^ yr), 
and "Old" {tj > 10^ yr) components (xy, xj and xo re- 
spectively). These age-ranges were defined on the basis of 
the simulations, by seeking which combinations of Xj's pro- 
duce smaller input — output residuals. Table and figure 
|l|show that these 3 components are very well recovered by 
the method, with uncertainties smaller than Axy ~ 0.05, 
Axi = 0.1, and Axo = 0.1 for S/N > 10. 



2.2.3 Mass, extinction and velocity dispersion 

Fig. 01 also shows the input versus output values of Ay, f* 
and the stellar mass M*. The latter is not an explicit input 
parameter of the models, but may be computed from /2 and 
the M-t/Lxg ratio of the different populations in the base. 
The uncertainties in these parameters are AAv < 0.05 mag, 
AlogAf* < 0.1 dex and Act* < 12 kms"^ for S/N > 10. 
(Table 0. 



2.2.4 Mean stellar age 

If one had to choose a single parameter to characterize the 
stellar population mixture of a galaxy, the option would cer- 
tainly be its mean age. We define two versions of mean stel- 
lar age (the logarithm of the age, actually), one weighted by 
light 



< logf* >L=^Xj logtj 



J = l 



and another weighted by stellar mass 



< logt* >M= logij 



(2) 



(3) 
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Figure 4. Input versus output synthesis parameters for simula- 
tions with S/N = 15 at Aq = 4020 A. The units are percentages 
for the condensed population vector [xy, xj, xq), magnitudes for 
Ay, kms"'^ for tr* and arbitrary for M*. The output parameters 
are represented by ±1 sigma error bars centered on the mean val- 
ues obtained by fitting 20 realizations of each of 65 test galaxies. 



Note that, by construction, both definitions are limited to 
the 1 Myr-13 Gyr range spanned by the base. The mass 
weighted mean age is in principle more physical, but, be- 
cause of the non-constant M/L of stars, it has a much less 
direct relation with the observed spectrum than < log > l . 

Fig. shows the input against output < log £* >_l for 
simulations with S/N — 10 and 20. The plots show that 
the mean age is a very robust quantity. The rms difference 
between input and output < logt* >l values is < 0.08 dex 
for S/N > 10, and < 0.14 dex for < logi* >m (Table Q. 
Although the uncertainties of < logt* >l and < logf* >m 
are comparable in absolute terms, the latter index spans a 
smaller dynamical range (because of the large M/L ratio 
of old populations), so in practice < log£t >l is the more 
useful of the two indices. 

Given that the mean stellar age is so well recovered by 
the method one might attempt more detailed descriptions 
of the star-formation history involving, say, higher moments 
of the age distribution. For instance, 
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Summary of parameter uncertainties 
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4.63 




•j.ui _i_ Kj.yjtj 


0.00 ± 0.05 


0.01 


-|- 


0.03 


0.01 


-|- 


0.03 


0.00 


-|- 


0.02 




f) 24 + 17 7'^ 


-0.11 ± 8.55 


0.07 


-|- 


5.92 


0.00 


-|- 


4.23 


0.05 


-|- 


2.81 




Q 1 9 _|_ 94 QO 


-1.40 ± 12.36 


-1.01 


-|- 


7.71 


-0.75 


-|- 


5.73 


-0.57 


-|- 


3.78 


log Mi, 


0.01 ± 0.11 


-0.01 ± 0.08 


-0.01 


± 


0.06 


-0.01 


± 


0.05 


-0.02 


± 


0.04 


< logU >L 


-0.01 ± 0.14 


0.01 ± 0.08 


0.00 


± 


0.06 


0.01 


± 


0.05 


0.01 


± 


0.04 


< logU >M 


-0.03 ± 0.20 


-0.04 ± 0.14 


-0.03 


± 


0.11 


-0.04 


± 


0.10 


-0.04 


± 


0.08 


log < >L 


-0.01 ± 0.15 


-0.01 ± 0.09 


0.00 


± 


0.08 


0.00 


± 


0.06 


0.00 


± 


0.05 


log < Zi, >M 


-0.03 ± 0.18 


-0.02 ± 0.13 


-0.03 


± 


0.11 


-0.02 


± 


0.09 


-0.03 


± 


0.08 


o-i(log t*) 


0.03 ± 0.16 


0.00 ± 0.10 


0.00 


± 


0.08 


-0.01 


± 


0.07 


-0.01 


± 


0.06 


crA/(log t*) 


-0.04 ± 0.09 


-0.02 ± 0.06 


-0.01 


± 


0.05 


-0.01 


± 


0.04 


0.00 


± 


0.04 




-0.01 ± 0.21 


0.00 ± 0.14 


0.02 


± 


0.11 


0.02 


± 


0.10 


0.02 


± 


0.09 




-0.11 ± 0.26 


-0.05 ± 0.19 


-0.03 


± 


0.16 


-0.02 


± 


0.14 


-0.02 


± 


0.12 




0.95 ± 0.02 


0.95 ± 0.02 


0.95 


± 


0.02 


0.95 


± 


0.02 


0.95 


± 


0.02 


Aa 


11.64 ± 4.05 


5.37 ± 1.19 


3.54 


± 


0.78 


2.65 


± 


0.58 


1.76 


± 


0.39 


Aa X (5/iV)A„ 


0.78 ± 0.27 


0.68 ± 0.20 


0.64 


± 


0.21 


0.59 


± 


0.22 


0.51 


± 


0.23 



Table 1. Each line corresponds to a parameter constructed by combining the parameters given by the synthesis. The different columns 
list the mean ± rms difference between output and input values of the corresponding quantity, as obtained from simulations with 
different signal-to-noise ratios. For instance, the mean output minus input difference in < logt* >i for S/N = 10 is 0.01 dex and the 
rms dispersion is 0.08 dex. The last 3 lines describe the statistics of goodness-of-fit indicators. The units arc percentage for light and 
mass fractions (xy ■ ■ .Mo)i mag for Ay, kms~^ for and o"*, percentage for A a and dex for logarithmic quantities. 



(TL (log t-i 



.j=i 



{\ogtj~ < logU >l) 



1/2 



(4) 



measures the flux-weighted standard deviation of the log 
age distribution, and might be useful to distinguish galaxies 
dominated by a single population from those which had con- 
tinuous or bursty star-formation histories. The uncertainty 
in this index is of order 0.1 dex. 



2.2.5 Mean stellar metallicity 

Given an option of what to chose as a second parameter to 
describe a mixed stellar population, the choice would likely 
be its typical metallicity. Analogously to what we did for 
ages, we define both light and mass-weighted mean stellar 
metallicities: 



and 



J = l 



< >M=y^jijZj 



(5) 



(6) 



both of which are bounded by the 0.2-2.5 Z@ base limits. 
Fig. |S] and Table Q show that the rms of A log < >m = 
log < >A/, output — log < Zi, >Af,input is of Order 0.1 dex. 
In absolute terms this is comparable to A < logt* >, but 
note that < > covers a much narrower dynamical range 
than < logt* >, so that in practice mean stellar metallic- 
ities are more sensitive to errors than mean ages. This is 



not surprising, given that age is the main driver of variance 
among SSP spectra, metallicity having a "second-order" ef- 
fect (e.g., Schmidt etal. 1991; Ronen, Aragon-Salamanca & 
Lahav 1999). This is the reason why studies of the stellar 
populations of galaxies have a much harder time estimating 
metallicities than ages, to the point that one is often forced 
to bin-over the Z information and deal only with age-related 
estimates such as < logt, > (e.g., Cid Fernandes etal. 2001; 
Cid Fernandes, Leao & Rodrigues Lacerda 2003; K03). 

Notwithstanding these notes, it is clear that uncertain- 
ties of ^ 0.1 dex in < > are actually good news, since 
they do allow us to recover useful information on an impor- 
tant but hard to measure property. This new tracer of stel- 
lar metallicity is best applicable to large samples of galaxies 
such as the SDSS. The statistics of samples help reducing 
uncertainties associated with < > estimates for single 
objects and allows one to investigate correlations between 
< > and other galaxy properties (Sodre et al. , in prepa- 
ration; Section f5.il . 



2.2.6 Age-Metallicity degeneracy 

Our method tends to underestimate < > for metal- 
rich systems and vice-versa. This bias is due to the infa- 
mous age-metallicity degeneracy. In order to verify to which 
degree our synthesis is affected by this well known prob- 
lem (e.g., Renzini & Buzzoni 1986; Worthey 1994; Bressan, 
Chiosi & Tantalo 1996) we have examined the correlation be- 
tween the output minus input residuals in < logt* >l and 
log < Zf, >L. The age-Z degeneracy acts in the sense of con- 
fusing old, metal-poor systems with young, metal-rich ones 
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Figure 5. Input versus output mean stellar ages and metallicities for simulations with S/N = 10 and 20. 



and vice-versa, which should produce anti-correlated resid- 
uals. These are indeed seen in figure|S] where small symbols 
represent all 1300 individual simulations and filled circles 
correspond to the mean residuals obtained for each set of 20 
perturbations of the 65 test galaxies. This anti-correlation is 
also present with the mass- weighted mean stellar metallicity 
< Z-i, >M, but is not as strong as for < Zi, >l- On the other 
hand, the uncertainty in < > m is always larger than for 
<Z^>L (Table 0. 

The age-Z degeneracy is thus present in our method, 
introducing systematic biases in our < Zi, > and < log > 
estimates at the level of up to ~ 0.1-0.2 dex. None of the 
results reported in this paper rely on this level of precision. 



2.3 Simulations for random parameters 

We have also carried out a separate set of simulations for 100 
test galaxies generated by random combinations of x, Ay, 
cr*. These simulations differ from the ones presented above 
in that the test galaxies are not restricted to match the range 
of properties inferred from the application of the synthesis to 
SDSS galaxies (Section]^. Given that these random galaxies 
span a broader (but less representative of our sample galax- 
ies) region of the parameter space, one might expect to find 



larger uncertainties in the derived properties. This was in- 
deed confirmed in the numerical experiments, although the 
effect is small. The uncertainties in the {xy ,xi ,xo) binned 
fractions increase by just a couple of percentage points with 
respect to those in Table Q Similarly, the increase in /S.Av 
is of just 0.02 mag, while A < logt* >l and A log < >m 
increase by ~ 0.02 and 0.03 dex respectively. 

The only properties whose uncertainties are substan- 
tially larger that those reported in Table are the stellar 
mass and velocity dispersion. For instance, while the SDSS- 
based simulations for S/N = 10 yield A log M^, = 0.08 dex 
and Act* = 12 kms~^, with random galaxies these errors 
increase to 0.15 dex and 24 kms"'^ respectively. The rea- 
son for this apparent discrepancy is due to the fact that the 
set of random test-galaxies contains a larger proportion of 
systems dominated by very young stellar populations than 
we find for SDSS galaxies. An error Axy in the light frac- 
tion associated to these populations must be compensated 
by errors in the older components, which carry most of the 
mass, even when xy is large. As illustrated by the size of 
the error-bars in figure |1] /S.xy increases as xy increases, 
so the errors in the mass fractions components increase too, 
leading to larger dispersion in Kh. Furthermore, galaxies 
with large xy have few absorption features to constrain the 
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Figure 6. Output minus input residuals in log < >i against residuals in < logt* >i. Small circles are the individual test galaxies. 
Large filled circles mark the mean residuals for 20 perturbed versions of the same test galaxy. The anti-correlation illustrates the cflfect 
of the age-metallicity degeneracy. 



kinematical broadening of the spectrum, which explains the 
larger dispersion in our ct* estimates. To prove this point, 
we have re-evaluated the uncertainties in and ct* ex- 
cluding test galaxies with xy > 70%, which corresponds 
to systems which formed > 20% of their stellar mass over 
the past < 10® yr. For this subset of the simulations, which 
comprise 72 galaxies (each one split onto 20 different spec- 
tra corresponding to independent Monte Carlo realizations 
of the noise), the uncertainties in mass and velocity disper- 
sion decrease to AlogAf* = 0.09 dex and Act* — 14 kms^^ 
(for S/N = 10), only slightly larger than those reported in 
Section r2.2.3l Uncertainties in other properties also decrease 
to values very similar to those listed in Tabled 

We thus conclude that the parameter uncertainties 
studied in Section [2.21 and summarized in Tableware only 
moderately affected by the design of the simulations, and 
represent fair estimates of the limitations of our synthesis 
method. 

Finally, we have carried out simulations using the Z = 
0.02^0 BC03 SSPs to generate further test galaxies. Galax- 
ies with such low metallicity are not expected to be present 
in significant numbers in the sample described in Section 
13.11 given that it excludes low luminosity systems like HII 
galaxies and dwarf ellipticals (which are also the least metal- 
lic ones by virtue of the mass-metallicity relation). Still, it 
is interesting to investigate what would happen in this case. 
When synthesized with our 0.2-2.5^© base (Fig. 0, these 
extremely metal poor galaxies are modeled predominantly 
with the Q.2Zq components, as intuitively expected. More- 
over, the mismatch in metallicity introduces non-negligible 
biases in other properties, like masses, mean ages and ex- 
tinction {Mi,, for instance, is systematically underestimated 
by 0.3 dex). Similar problems should be encountered when 
modeling systems with Z > 2.5^©. These results serve as 



a reminder that our base spans a wide but finite range in 
stellar metallicity, and that extrapolating these limits has 
an impact on the derived physical properties. While there 
is no straightforward a priori diagnostic of which galaxies 
violate these limits, in general, one should be suspicious of 
objects with mean too close to the base limits. 

2.4 Summary of the simulations 

Summarizing this theoretical study, we have performed sim- 
ulations designed to evaluate the accuracy of our spectral 
synthesis method. The simulations mimic in as much as pos- 
sible the wavelength range, spectral resolution, error spec- 
trum, and S/N of the actual SDSS data studied below. Sev- 
eral physically motivated combinations of the synthesis pa- 
rameters were investigated to establish their precision at dif- 
ferent S/N. Table summarizes the uncertainties in these 
quantities and a few additional ones not explicitly mentioned 
above. In what follows we focus on 5 parameters: Stellar 
mass, velocity dispersion, extinction, mean stellar ages and 
metallicities, all of which were found to be adequately re- 
covered. 

This exercise demonstrates that we are capable of pro- 
ducing reliable estimates of several parameters of astrophys- 
ical interest, at least in principle. We must nevertheless em- 
phasize that this conclusion relies entirely on models and on 
an admittedly simplistic view of galaxies. When applying 
the synthesis to real galaxy spectra, a series of other effects 
come into play. For instance, the extinction law appropri- 
ate for each galaxy likely differs from the one used here: 
in our Galaxy, the ratio of total to selective extinction of 
stellar sources, Rv = Av/E{B — V), is known to depend 
on the line of sight (Cardelli et al. 1989; Patriarchi et al. 
2001); in addition, the wavelength dependence of the atten- 
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uation of light from an extended source such as a galaxy 
includes the effects of scattering back into the light beam, 
and depends on the relative distribution of stars and dust 
(Witt etal. 1992; Gordon etal. 1997). Also, one might ex- 
pect that each population of stars is affected by a distinct 
extinction (Panuzzo etal. 2003; Chariot & Fall 2000). Simi- 
larly, while in the evolutionary tracks adopted here the metal 
abundances are scaled from the solar values, non-solar abun- 
dance mixtures are known to occur in stellar systems (e.g., 
Trager et al. 2000a, b) , not to mention uncertainties in the 
SSP models and the always present issue of the IMF. In 
short, evidence against these simplifying hypothese abound. 

Accounting for all these effects in a consistent way is not 
currently feasible. We mention these caveats not to dismiss 
simple models, but to highlight that all parameter uncer- 
tainties discussed above are applicable within the scope of 
the model. Hence, while the simulations lend confidence to 
the synthesis method, one might remain skeptical of its ac- 
tual power. The next sections further address the reliability 
of the synthesis, this time from a more empirical perspective. 



3 ANALYSIS OF A VOLUME-LIMITED 
GALAXY SAMPLE 

In this section we apply our synthesis method to a large 
sample of SDSS galaxies to estimate their stellar population 
properties. We also present measurements of emission line 
properties, obtained from the observed minus synthetic spec- 
tra. The information provided by the synthesis of so many 
galaxies allows one to address a long menu of astrophysi- 
cal issues related to galaxy formation and evolution. Before 
venturing in the exploration of such issues, however, it is im- 
portant to validate the results of the synthesis by as many 
means as possible. Hence, the goal of the study presented 
below is not so much to explore the physics of galaxies but 
to provide an empirical test of our synthesis method. The 
results reported in this section are used in Sections |1] and El 
with this purpose. 

3.1 Sample definition 

The spectroscopic data used in this work were taken from 
the SDSS. This survey provides spectra of objects in a large 
wavelength range (3800-9200 A) with mean spectral resolu- 
tion A/AA ~ 1800, taken with 3 arcsec diameter fibers. The 
most relevant characteristic of this survey for our study is 
the enormous amount of good quality, homogeneously ob- 
tained spectra. The data analyzed here were extracted from 
the SDSS main galaxy sample available in the Data Release 
2 (DR2; Abazajian et al. 2004). This flux-limited sample con- 
sists of galaxies with reddening-corrected Petrosian r-band 
magnitudes r < 17.77, and Petrosian r-band half-light sur- 
face brightnesses /iso < 24.5 mag arcsec^^ (Strauss et al. 
2002). 

From the main sample, we first selected spectra with 
a redshift confidence > 0.35. Following the conclusions of 
Zaritsky, Zabludoff, & Willick (1995), we have imposed a 
redshift limit of 2 > 0.05 (trying to avoid aperture effects 
and biases; see e.g. Gomez et al. 2003) and selected a volume 
limited sample up to z — 0.1, corresponding to a r-band 
absolute magnitude limit of A-I{r) — —20.5. The absolute 



magnitudes used here are k-corrected with the help of the 
code provided by Blanton et al. (2003; kcorrect v3_2) and 
assuming the following cosmological parameters; Ho ~ 70 
km s"'^ Mpc~^, fii./ — 0.3 and Qa = 0.7. We also restricted 
our sample to objects for which the observed spectra show 
a S/N ratio in g, r and i bands greater than 5. These re- 
strictions leave us with a volume limited sample containing 
50362 galaxies, which leads to a completeness level of ~ 98.5 
per cent. 



3.2 Results of the spectral synthesis 

All 50362 spectra were brought to the rest-frame (using the 
redshifts in the SDSS database), sampled from 3650 to 8000 
A in steps of 1 A, corrected for Galactic extinction'^ using 
the maps given by Schlegel, Finkbeiner & Davis (1998) and 
the extinction law of Cardelli etal. (1989, with Ry = 3.1), 
and normalized by the median fiux in the 4010-4060 A re- 
gion. The S/N ratio in this spectral window spans the 5-30 
range, with median value of 14. Besides the masks around 
the lines listed in Section r2.2.1l we exclude points with SDSS 
flag > 2, which signals bad pixels, sky residuals and other 
artifacts. After this pre-processing, the spectra are fed into 
the STARLIGHT code described in Section |TT] On aver- 
age, the synthesis is performed with N\ = 3677 points, af- 
ter discounting the ones which are clipped by our < 3 sigma 
threshold (typically 40 points) and the masked ones. 

The spectral fits are generally very good, as illustrated 
in Figs. Handel The mean value of x^/^x is 0.78. In fact, 
this is somewhat too good, since from the simulations we ex- 
pect x'/Nx ~ 0.95. This is a minor difference, which could 
be fixed decreasing the errors in Ox by ~ 10%. We further 
quantify the quality of the fits by Ax, the mean value of 
\Ox — Mx\/Ox over all non-masked points. From the sim- 
ulations we expect this alternative figure of merit to be of 
order of 0.6 times the noise-to-signal ratio at Ao (Table 0. 
This is exactly the mean value of Aa x 5'/A'^ in the actual 
fits, again indicating acceptable fits. 

The total stellar masses of the galaxies were obtained 
from the stellar masses derived from the spectral synthe- 
sis (which correspond to the light entering the fibers) by 
dividing them by (1 — /), where / is the fraction of the to- 
tal galaxy luminosity in the z-band outside the fiber. This 
approach, which neglects stellar population and extinction 
gradients, leads to an increase of typically 0.5 in log M*. We 
did not apply any correction to the velocity dispersion esti- 
mated by the code given that the spectral resolution of the 
BC03 models and the data are very similar. 

We point out that we did not constrain the extinction 
Av to be positive. There are several reasons for this choice: 
(a) some objects may be excessively dereddened by Galac- 
tic extinction; (b) some objects may indeed require bluer 
SSP spectra than those in the base; (c) the observed light 
may contain a scattered component, which would induce 
a bluening of the spectra not taken into account by the 
adopted pure extinction law; (d) constraining Ay to have 
only positive values produces an artificial concentration of 



^ Unlike in the first data release, the final calibrated spectra from 
the DR2 arc not corrected for foreground Galactic reddening. 
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solutions at Ay = 0, an unpleasant feature in the Ay distri- 
bution. Interestingly, most of the objects for which we de- 
rive negative Ay (typically —0.1 to —0.3 mag) are early-type 
galaxies. These galaxies are dominated by old populations, 
and expected to contain little dust. This is consistent with 
the result of K03, who find negative extinction primarily in 
galaxies with a large -Dn(4000). The distribution of Ay for 
these objects, which can be selected on the basis of spec- 
tral or morphological properties, is strongly peaked around 
Ay = 0, so that objects with Ay < can be considered as 
consistent with having zero extinction. In any case, none of 
the results reported in this paper is significantly affected by 
this choice. 



3.3 Emission line measurements 

Besides providing estimates of stellar population properties, 
the synthesis models allow the measurement of emission 
lines from the "pure-emission" , starlight subtracted spectra 
(Oa -Ma). We have measured the lines of [O ll] AA3726,3729, 
[O iii]A4363, H/3, [O iii]AA4959,5007, [O i]A6300, [N ii]A6548, 
Hq, [N ii]A6584 and [S n]AA6717,6731. Each line was treated 
as a Gaussian with three parameters: width, offset (with re- 
spect to the rest-frame central wavelength), and flux. Lines 
from the same ion were assumed to have the same width and 
offset. We have further imposed [O iii]A5007/[O iii]A4959 = 
2.97 and [N ll]A6584/[N n]A6548 = 3 flux ratio constraints. 
Finally, we consider a line to have signiflcant emission if its 
fit presents a S/N ratio greater than 3. 

In some of the following analysis, galaxies with emis- 
sion lines are classified according to their position in the 
[O in]/H/3 versus [N iiJ/Hq diagram proposed by Bald- 
win, Phillips & Terlevich (1981) to distinguish normal star- 
forming galaxies from galaxies containing active galactic nu- 
clei (AGN) . We define as normal star-forming galaxies those 
galaxies that appear in this diagram and are below the curve 
defined by K03 (see also Brinchmann et al. 2004) . Objects 
above this curve are transition objects and galaxies contain- 
ing AGN. 



3.4 Aperture bias 

A point that deserves mention here is the bias that may be 
introduced in the analysis due to the use of small fibers to 
measure the galaxy spectra. This effect, known as aperture 
bias, may introduce a redshift dependence in the measured 
galajcy spectra, since the fraction of galaxy light received 
by a fiber increases with increasing distance. Zaritsky etal. 
(1995), based on an analysis of spectra from the Las Cam- 
panas Redshift Survey, find that a lower limit on redshifts 
of z = 0.05 minimizes the aperture bias. This effect has also 
been discussed by K03, Gomez et al. (2003) and Tremonti 
et al. (2004) for SDSS spectra. For instance, these authors 
found that the galaxy z-band M/L ratio and the gas-phase 
oxygen abundance decrease by ~ 0.1 dex over the redshift 
range of the survey, indicating that these quantities are mod- 
erately affected by aperture bias. The redshift range consid- 
ered in our study is smaller than that of the entire survey, 
implying that the aperture effects are expected to be even 
smaller in our sample. 

In order to verify whether our results are significantly 



affected by this bias, we investigated the behaviour of some 
of the quantities resulting from our analysis {Mi,, < >, 
< logt* > and Ay) as a function of redshift for galaxies 
with luminosities above and below M{r) = —21.17, the me- 
dian luminosity of our sample. We divided the sample in 
several redshift bins containing the same number of objects, 
and computed the median value and the quartiles of each 
quantity in each bin. Fig. |7| shows as solid lines the median 
values of each distribution, as well as their respective quar- 
tiles. None of the quantities seems to be strongly affected by 
this bias. The largest correlation with z appears for light- 
weighted ages, corresponding to a change along our redshift 
distribution of about 0.1 and 0.26 dex for faint and bright 
galaxies, respectively. This is a plausible result since the light 
of nearby objects seen by the fiber aperture is dominated by 
the older stellar populations of their bulges. For the other 
quantities the variations are well below 0.1 dex. 



4 COMPARISONS WITH THE MPA/JHU 
DATABASE 

The SDSS database has been explored by several groups, 
using different approaches and techniques. The MPA/JHU 
group has recently publicly released catalogues^ of derived 
physical properties for 211894 SDSS galaxies, including 
33589 narrow-line AGN (K03, see also Brinchmann et al. 
2004). These catalogues are based on the K03 method to 
infer the star formation histories, dust attenuation and stel- 
lar masses of galaxies from the simultaneous analysis of the 
4000 A break strength, D„(4000), and the Balmer line ab- 
sorption index HSa- These two indices are used to constrain 
the mean stellar ages of galaxies and the fractional stellar 
mass formed in bursts over the past few Gyr, and a compar- 
ison with broad-band photometry then allows to estimate 
the extinction and stellar masses. 

The MPA/JHU catalogues provide very useful bench- 
marks for similar studies. In this section we compare the 
values of some of the parameters from these catalogues with 
our own estimates. Catalogues of galaxy properties obtained 
with our synthesis method will be made available in due 
course. Besides comparing directly measurements of phys- 
ical quantities, our aim is also to highlight the differences 
that may appear due to the use of different methods and 
procedures. 

4.1 Stellar extinction 

The MPA/JHU group estimates the z-band stellar extinc- 
tion Az through the difference between model and mea- 
sured colours, assuming an attenuation curve proportional 
to X-" ''. In our case, the extinction Ay is derived directly 
from the spectral fitting, carried out with the Milky Way 
extinction law (Cardelli etal. 1989, c.f. Section 12.11 . for 
which Az = 0.4849y4v assuming A^ = 8931 A. Fig. m shows 
that these two independent estimates are very strongly 
and linearly correlated, with a Spearman rank correla- 
tion coefficient rs ~ 0.95. However, the values of A^ re- 
ported by the MPA/JHU group (column 17 of their Stellar 

^ available at|http://www.mpa-garching.mpg.de/SDSS/| 
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Figure 7. Trends of median values (and respective quartiles) of the stellar mass, mean stellar metallicity, mean stellar age and the 
V-band extinction as a function of redshift. Results are shown for galaxies with luminosities above ("Bright", top panels) and below 
("Faint", bottom panels) the median luminosity of the sample. In this and following figures the gray scale level represents the number 
of galaxies in each pixel, darker pixels having more galaxies; the upper number in the top right of each panel indicates the total number 
of galaxies in the plot, and the lower one is the Spearman rank correlation coefficient (rg). 
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Figure 8. Comparison of the stellar extinction in the z-band 
determined by the MPA/JHU group with that estimated though 
the method described in this paper. The solid line shows ay = ax 
linear fit to the data, while the dash ones is the identity line 

{y = x)- 



Mass Catalogue) are systematically larger than our values: 
A^MPA/JHU) ~ 2.51yl4This Work) in the median. 

This discrepancy is only apparent. The Galactic extinc- 
tion law is substantially harder than A""'^. One thus expects 
to need less extinction when modeling a given galaxy with 
the former law than with the latter. This was confirmed 



by STARLIGHT fits to a sub-set of SDSS galaxies using 
a A^'^ '^-law, which yield a value of Av typically 1.77 times 
larger than those obtained with the Cardelli et al. (1989) law. 
Since the A^-fAy conversion factors are 0.7030 and 0.4849 
for the A"" '^ and Cardelli etal. laws, one finds that the A^. 
values obtained for the two laws should differ by a factor of 
1.77 X 0.7030/0.4849 = 2.57. This is very close to the empir- 
ically derived factor of 2.51 (Fig.jHJ. We thus conclude that 
there are no substantial differences between the MPA/JHU 
and our estimates of the stellar extinction other than those 
implied by the differences in the reddening laws adopted in 
the two studies. 



4.2 Stellar masses 

Fig. compares our results for the total stellar masses to 
the MPA/JHU extinction-corrected stellar masses (column 
9 of their Stellar Mass Catalogue) . The two estimates of Af* 
correlate very well, with rg = 0.89. The quantitative agree- 
ment is also good, with a median difference of just 0.1 dex. 
This small offset cannot be attributed to the different IMFs 
employed in the two studies (Chabrier 2003 here and Kroupa 
2001 in K03), since, as illustrated in figure 4 of BC03, these 
two IMFs yield practically identical Af/L ratios. Instead, 
this offset seems to be due to a subtle technicality. Whereas 
we adopt the M/L ratio of the best model, the MPA/JHU 
group derives M/L comparing the observed values of the 
-Dn (4000) and HSa indices with a library of 32000 models. 
Each model is then weighted by its likelihood, and a prob- 
ability distribution for M/L is computed. The MPA/JHU 
mass is the median of this distribution, which is not nec- 
essarily the same as the best-x^ value. In fact, the Stellar 
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Figure 9. (a) Comparison of tlie stellar mass determined by the MPA/JHU group with our estimate. The solid line shows a y = ax + b 
linear fit to the data, while the dash ones is the identity line {y = x). (b) Relation of the ratio between our and the MPA/JHU masses 
and our estimate of the V-band stellar extinction, (c) Comparison of the stellar masses after correcting for differences induced by the 
different extinction laws. 



Mass Catalogue of Brinchmann et al. (2004) lists in its col- 
umn 8 the best masses, which are systematically larger 
than the median ones by ~ 0.1 dex, identical to the median 
difference identified above. 

Part of the scatter in Fig. can be attributed 
to the different extinction laws. Extinction contributes 
+0.4Aao to the estimated logAf*. As discussed above, 
using a A^'^'^ law in the synthesis yields values of Ay 
which are 1.77 times larger than using the Cardelli 
et al. (1989) law. Furthermore, the mass-to-light ratios ob- 
tained with the two laws are very similar. From this 
we expect that log (This Work) /M* (MPA/JHU) ^ 
— 0.3Av(This Work), in good agreement with the observed 
relation (Fig.l^t). Correcting for this effect by adding O.SAv 
to our mass estimates indeed produces a better correlation, 
with rs ~ 0.95, as illustrated in Fig. EJ:. 

Overall, we conclude that the two mass estimates agree 
to within 0.4 dex. This level of agreement is similar to that 
recently found by Drory, Bender & Hopp (2004) in their 
comparison between the MPA/JHU masses and estimates 
based on SDSS plus 2MASS photometry. It is quite remark- 
able that, despite the substantial differences between our 
approaches and the underlying assumptions, the estimated 
stellar masses are so similar over a wide range of masses. 
On the other hand, this may not be so surprising given that 
the MPA/JHU group estimates of physical properties are 
ultimately based on a comparison of observed indices with 
an extensive library of galaxy spectra constructed out of 
the BC03 evolutionary synthesis models (K03). Since this 
library spans a wide of metallicities and star-formation his- 
tories, the agreement between our estimates and those of 
the MPA/JHU group may be simply indicating that the 
A'* — 45 SSPs from BC03 used in our spectral synthesis 
span a similar parameter-space to that covered by the K03 
models. 



4.3 Velocity dispersion 

We use the sub-sample of galaxies with active nuclei to com- 
pare our measurements of absorption line broadening due to 
galaxy velocity dispersion and/or rotation, a*, with those of 
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Figure 10. Comparison of the velocity dispersions (a) and oxy- 
gon abundances (b) determined by the MPA/JHU group with our 
estimates. The solid line shows s.y = ax-\-h linear fit to the data, 
while the dash ones is the identity line [\j = x). 



the MPA/JHU group, since they list this quantity only in 
their AGN Catalogue (in column 16). The comparison is 
displayed in Fig. llOb . The Spearman correlation-coefficient 
in this case is rs ~ 0.91 and the median of the difference 
between the two estimates is just 9 kms~^, indicating an 
excellent agreement between both studies. 



4.4 Emission lines and nebular metallicities 

Brinchmann et al. (2004) also provide, in their Emission line 
Catalogue, emission lines fluxes and equivalent widths which 
can be compared with our own measurements. In both stud- 
ies the emission lines are measured after subtracting from 
the observed spectrum a model spectrum representing the 
stellar emission. In our case this is done with our synthesized 
spectra. The MPA/JHU group adopts a similar approach 
(see Tremonti et al. 2004 for a brief description) by fltting the 
observed continuum with BC03 models. They adopt, how- 
ever a single metallicity model and a different extinction law. 
We have compared the fluxes and equivalent widths of Ha, 
[N ii]A6584, [O n]A3727, H/3 and [O ni]A5007 as measured 
by our code and that obtained by the MPA/JHU group. 
We do not find any significant difference between these val- 
ues; the largest discrepancy (~ 5 per cent) was found for the 
equivalent widths of Ha and [N ll], probably due to different 
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estimates of the continuum level and the associated underly- 
ing stellar absorption. Our emission line measurements are 
also in good agreement with those in Stasihska et al. (2004) , 
who fit the Balmer lines with emission and absorption com- 
ponents, instead of subtracting a starlight model. 

In Fig. llOb we plot our estimates of the nebular oxygen 
abundance against those obtained by the MPA/JHU group 
(Catalogue of Gas Phase Metallicities, median values; see 
Tremonti et al. 2004) . In order not to introduce any bias 
due to the use of different indicators of the oxygen abun- 
dance, we have estimated O/H using the calibration of the 
([O ll]A3726,3729 + [O lll]A4959,5007)/ H/3 ratio given by 
Tremonti etal. (2004) in their equation (1). This calibration 
is based on simultaneous fits of the most prominent emis- 
sion lines with a model designed for the interpretation of 
integrated galaxy spectra (Chariot & Longhetti 2001). The 
oxygen abundances estimated in such a way differ by just 
~ 0.05 dex, as shown in Fig. llOb . 

Overall, we conclude that our spectral synthesis method 
yields estimates of physical parameters in good agreement 
with those obtained by the MPA/JHU group, considering 
the important differences in approach and underlying as- 
sumptions. 
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Figure 11. Nebular oxygen abundance versus mass-weighted 
stellar metallicity, both in solar units, for normal star-forming 
galaxies in our sample. The median values and quartiles in bins 
of same number of objects are shown as thin solid lines. The 
dashed line is a robust fit for the relation. 



5 EMPIRICAL RELATIONS 

Yet another way to assess the validity of physical properties 
derived through a spectral synthesis analysis is to investi- 
gate whether this method yields astrophysically reasonable 
results. In this section we follow this empirical line of reason- 
ing by comparing some results obtained from our synthesis 
of SDSS galaxies (which excludes emission lines) with those 
obtained from a direct analysis of the emission lines. Our 
aim is to demonstrate that our synthesis results do indeed 
make sense. A more detailed discussion of most of the points 
below will be presented in other papers of this series. 



5.1 Stellar and Nebular Metallicities 

Our spectral synthesis approach yields estimates of the mean 
metallicity of the stars in a galaxy, < >. The analysis 
of emission lines, on the other hand, gives estimates of the 
present-day abundances in the warm interstellar medium. 
Although stellar and nebular metallicities are not expected 
to be equal, it is reasonable to expect that they should 
roughly scale with each other. 

Fig. 111! shows the correlation between mass-weighted 
stellar metallicities and the nebular oxygen abundance (com- 
puted as in Section r4.4ll . both in solar units*, for our sample 
of normal star-forming galaxies. A correlation is clearly seen, 
although with large scatter {rs = 0.42). Galaxies with large 
stellar metallicities also have large nebular oxygen abun- 
dances; galaxies with low stellar abundances tend to have 
smaller abundances. The observed scatter is qualitatively 
expected due to variations in enrichment histories among 
galaxies. A robust linear fitting gives the following relation: 



* The solar unit adopted for the nebular oxygen abundance is 
12 -I- log(O/H)0 = 8.69 (AUende Prieto, Lambert & Apslund 
2001). 



log ( '^^^ I = 0.34 + 0.23 log 
0.00049/ ^ 



0.02 



(7) 



with a dispersion of 0.08 dex. Notice that in this expression 
both stellar and nebular metallicities are normalized to solar 
units {Zq = 0.02 and (O/H)© = 0.00049 respectively). 

Nebular and stellar metallicities are estimated through 
completely different and independent methods, so the corre- 
lation depicted in Fig. Illl provides an a posteriori empirical 
validation for the stellar metallicity derived by the spectral 
synthesis. The possibility to estimate stellar metallicities for 
so many galaxies is one of the major virtues of spectral syn- 
thesis, as it opens an important window to study the chem- 
ical evolution of galaxies and of the universe as a whole 
(Panter etal. 2004; Sodre etal. in prep.). 



5.2 Stellar and Nebular extinctions 

The stellar extinction in the V-band is one of the products of 
our STARLIGHT code. A more traditional and completely 
independent method to evaluate the extinction consists of 
comparing the observed Ha/H/3 Balmer decrement to the 
theoretical value. The intrinsic value of _F(Ha) / F(H/3) is not 
very sensitive to the physical conditions of the gas, ranging 
from 3.03 for a gas temperature of 5000 K to 2.74 at 20000 
K (Osterbrock 1989). Adopting a value of 2.86 for this ratio, 
the "Balmer extinction" (Stasihska et al. 2004) is given by 



4 Balmer 01 i 

Ay = 6.31 log 



F(Hq)/F(H/3) 



2.86 



(8) 



where the 6.31 coefficient comes from assuming the Cardelli 
etal. (1989) extinction curve. 

Fig. 1121 presents a comparison between the stellar Ay 
and Af/'™"^"^. These two extinctions are determined in com- 
pletely independent ways, and yet, our results show that 
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Figure 12. Relation between the nebular (A^"'™'"') and stellar 
(Ay) extinctions for our sample of normal star-forming galaxies. 
The solid line is a robust fit for the relation. 
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Figure 13. (a) Equivalent width of Ha versus the light- weighted 
mean stellar age for normal emission line galaxies in our sam- 
ple, (b) Relation between the D4000 index and the light-weighted 
mean stellar age. The solid lines are robust fits for the relations. 

they are closely linked, with rs = 0.61. A linear bisector 
fitting yields 

^Balmer ^ q 34 + 1.81.4^ (9) 

Note that the angular coefficient in this relation indicates 
that nebular photons are roughly twice as extincted as the 
starlight. This "differential extinction" is in very good qual- 
itative and quantitative agreement with empirical studies 
(Fanelli et al. 1988; Calzetti, Kinney & Storchi-Bergmann 
1994; Gordon etal. 1997; Mas-Hesse & Kunth 1999). We 
shall explore the implications of this result for the intrin- 
sic colours of star-forming galaxies in another paper of this 
series (Stasihska etal. , in prep.). 

5.3 Relations with mean stellar age 

The equivalent width (EW) of Ha is related to the ratio of 
present to past star formation rate of a galaxy (e.g., Ken- 
nicutt 1998). It is thus expected to be smaller for older 
gala^xies. Fig. I13h , shows the relation between EW(Ha) and 
the mean light-weighted stellar age obtained by our spectral 



synthesis. The anti-correlation, which has rs = —0.78, is 
evident. From this plot we can derive an empirical relation 
which can be used to estimate < logt* >l (for Ao = 4020 
A) of star-forming galaxies through the measurement of 
EW(Ha): 

< logt* >L= 10.10 - 1.30 log EW(_H"a) (10) 

for tt in yr and EW(Ha) in A. It is worth stressing that 
these two quantities are obtained independently, since the 
spectral synthesis does not include emission lines. 

Another quantity that is considered a good age indi- 
cator, even for galaxies without emission lines, is the 4000 
A break, D4000. We measured this index following Bruzual 
(1983), who define D4000 as the ratio between the aver- 
age value of in the 4050-4250 and 3750-3950 A bands. 
The relation between < logt* >l and D4000 is shown in 
Fig. 113b . Note that the concentration of points at the high 
age end reflects the upper age limit of the base adopted here, 
13 Gyr (c.f. Section |2. 2.111 . The correlation is very strong 
(rs = 0.94), showing that indeed D4000 can be used to es- 
timate empirically mean light-weighted galaxy ages, despite 
its metallicity dependence for very old stellar populations 
(older than 1 Gyr, as shown by K03). From the tight rela- 
tion between < log >z, and D4000 we derive the following 
empirical relation 

< log >L= 6.76 + 9.41 log D4000 (11) 

where is in yr. This robust fit reproduces < logt* >!, to 
within an rms dispersion of 0.15 dex. 

5.4 M*-CTt relation 

Fig. 1141 shows the relation between stellar mass and ct*, ob- 
tained from the synthesis. The relation is quite good, with 
rs = 0.79. The solid line displayed in the figure is 

log Ah = 6.44 + 2.04 log a* (12) 

for in Mq and a* in kms^^, obtained with a bisector 
fitting. The figure also shows as a dashed line a fit assum- 
ing Mi, oc (j^, expected from the virial theorem under the 
(unrealistic) assumption of constant mass surface density. 
In both cases we have excluded from the fit galaxies with 
cr* < 35 kms~^, which corresponds to less than half the 
spectral resolution of both data and models. 

This is another relation that is expected a priori if we 
have in mind the Faber- Jackson relation for ellipticals and 
the Tully-Fisher relation for spirals. For early-type galax- 
ies, (J* is a measure of the central velocity dispersion, which 
is directly linked to the gravitational potential depth, and, 
through the virial theorem, to galactic mass. For late-type 
systems, a* has contributions of isotropic motions in the 
bulges, as well as of the rotation of the disks, and is also ex- 
pected to relate with galactic mass. Another aspect that it 
is interesting to point out in Fig. I14l is that the dispersion in 
the M*-cr* relation decreases as we go from low-luminosity, 
rotation-dominated systems, for which the values of cr* de- 
pend on galaxy inclination and bulge-to-disk ratio, to high- 
luminosity, mostly early-type systems, which obey a much 
more regular (and steeper) relation between ct* and M*. 

This relation, between a quantity that is not directly 
linked to the synthesis, a*, and another one that is a prod- 
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Figure 14. Mass- velocity dispersion relation for our sample. The 
solid line is a robust fit for the data while dashed one is a fitting 
assuming Af* oc cr^. 

uct of our synthesis, M*, is yet another indication that the 
results of our STARLIGHT code do make sense. 



6 SUMMARY 

We have developed and tested a method to fit galaxy spec- 
tra with a combination of spectra of individual simple stel- 
lar populations generated with state-of-the art evolutionary 
synthesis models. The main goal of this investigation was to 
examine the reliability of physical properties derived in this 
way. This goal was pursued by three different means: simu- 
lations, comparison with independent studies, and analysis 
of empirical results. Our main results can be summarized as 
follows: 

(i) Simulations tailored to match the characteristics of 
SDSS spectra show that the individual SSP strengths, en- 
coded in the population vector x, are subjected to large un- 
certainties, but robust results can be obtained by compress- 
ing X into coarser but useful indices. In particular, physically 
motivated indices such as mean stellar ages and metallicities 
are found to be well recovered by spectral synthesis even for 
relatively noisy spectra. Stellar masses, velocity dispersion 
and extinction are also found to be accurately retrieved. 

(ii) We have appUed our STARLIGHT code to a vol- 
ume limited sample of over 50000 galaxies from the SDSS 
Data Release 2. The spectral fits are generally very good, 
and allow accurate measurements of emission lines from the 
starlight subtracted spectrum. Catalogues of physical and 
emission line properties derived for this sample will be made 
available in due course. We also report that work is under- 
way to produce a flexible, user friendly and publicly available 
version of STARLIGHT. 

(iii) We have compared our results to those obtained by 
the MPA/JHU group (K03; Brinchmann et al. 2004) with a 
different method to characterize the stellar populations of 
SDSS galaxies. The stellar extinctions and masses derived 



in these two studies are very strongly correlated. Further- 
more, differences in the values of Ay and M* are found to 
be mostly due to the differences in the model ingredients 
(extinction law). Our estimates of stellar velocity dispersions 
and emission line properties are also in good agreement with 
those of the MPA/JHU group. 

(iv) The confidence in the method is further strengthened 
by several empirical correlations between synthesis results 
and independent quantities. We find strong correlations be- 
tween stellar and nebular metallicites, stellar and nebular 
extinctions, mean stellar age and the equivalent width of 
Hq, mean stellar age and the 4000 A break, stellar mass 
and velocity dispersion. These are all astrophysically rea- 
sonable results, which reinforce the conclusion that spectral 
synthesis is capable of producing reliable estimates of phys- 
ical properties of galaxies. 

Overall, these results validate spectral synthesis as a 
powerful tool to study the history of galaxies. Other papers 
in this series will take advantage of this tool to address issues 
regarding aspects of galaxy formation and evolution. 
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